Supplemental material for “ Model - based influences on humans ’ choices and striatal prediction errors ”
نویسندگان
چکیده
The task consists of three states (first stage: sA; second stage: sB and sC), each with two actions (aA and aB). The goal of both the model-based and model-free subcomponents of the algorithm is to learn a state-action value function Q(s,a) mapping each state-action pair to its expected future value. On trial t, we denote the first-stage state (always sA) by s1,t, the second-stage state by s2,t, the firstand secondstage actions by a1,t and a2,t , and the firstand second-stage rewards as r1,t (always zero) and r2,t.
منابع مشابه
Model-based influences on humans’ choices and striatal prediction errors
The mesostriatal dopamine system is prominently implicated in model-free reinforcement learning, with fMRI BOLD signals in ventral striatum notably covarying with model-free prediction errors. However, latent learning and devaluation studies show that behavior also shows hallmarks of model-based planning, and the interaction between model-based and model-free values, prediction errors, and pref...
متن کاملDevelopment of Lifetime Prediction Model of Lithium-Ion Battery Based on Minimizing Prediction Errors of Cycling and Operational Time Degradation Using Genetic Algorithm
Accurate lifetime prediction of lithium-ion batteries is a great challenge for the researchers and engineers involved in battery applications in electric vehicles and satellites. In this study, a semi-empirical model is introduced to predict the capacity loss of lithium-ion batteries as a function of charge and discharge cycles, operational time, and temperature. The model parameters are obtai...
متن کاملInformation Seeking as Chasing Anticipated Prediction Errors
When faced with delayed, uncertain rewards, humans and other animals usually prefer to know the eventual outcomes in advance. This preference for cues providing advance information can lead to seemingly suboptimal choices, where less reward is preferred over more reward. Here, we introduce a reinforcement-learning model of this behavior, the anticipated prediction error (APE) model, based on th...
متن کاملA New High-order Takagi-Sugeno Fuzzy Model Based on Deformed Linear Models
Amongst possible choices for identifying complicated processes for prediction, simulation, and approximation applications, high-order Takagi-Sugeno (TS) fuzzy models are fitting tools. Although they can construct models with rather high complexity, they are not as interpretable as first-order TS fuzzy models. In this paper, we first propose to use Deformed Linear Models (DLMs) in consequence pa...
متن کاملStriatal prediction errors support dynamic control of declarative memory decisions
Adaptive memory requires context-dependent control over how information is retrieved, evaluated and used to guide action, yet the signals that drive adjustments to memory decisions remain unknown. Here we show that prediction errors (PEs) coded by the striatum support control over memory decisions. Human participants completed a recognition memory test that incorporated biased feedback to influ...
متن کامل